A Linear Predictive Front-end Processor for Speech Recognition in Noisy Environments

نویسنده

Yariv Ephraim

چکیده

We investigate the performance of a recent algorithm €or linear predictive (LP) modeling of speech signals, which have been degraded by uncorrelated additive noise, as a front-end processor in a speech recognition system. The system is speaker dependent, and recognizes isolated words, based on dynamic time warping principles. The LP model for the clean speech is estimated through appropriate composite modeling of the noisy speech. This is done by minimizing the Itakura-Saito distortion measure between the sample spectrum of the noisy speech and the power spectral density of the composite model. This approach results in a “filtering-modeling” scheme in which the filter for the noisy speech, and the LP model for the clean speech, are alternatively optimized. The proposed system was tested using the 26 word English alphabet, the ten English digits, and the three command words, “stop,” “error,” and “repeat,” which were contaminated by additive white noise at 5-20 dB signal to noise ratios ( S N R ’ s ) . By replacing the standard LP analysis with the proposed algorithm, during training on the clean speech and testing on the noisy speech, we achieve an improvement in recognition accuracy equivalent to an increase in input SNR of approximately 10 dB.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Recognition in Noisy Environment Using Different Feature Extraction Techniques

In this paper, different feature extraction methods for speech recognition system such as Melfrequency cepstral coefficients (MFCC), linear predictive coefficient cepstrum (LPCC) and Bark frequency cepstral coefficients (BFCC) are implemented and the comparison is done based on average recognition accuracy. We suggest a noise robust isolated word speech recognition system which can be applied i...

متن کامل

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...

متن کامل

A Robust Front-End Processor combining Mel Frequency Cepstral Coefficient and Sub-band Spectral Centroid Histogram methods for Automatic Speech Recognition

Environmental robustness is an important area of research in speech recognition. Mismatch between trained speech models and actual speech to be recognized is due to factors like background noise. It can cause severe degradation in the accuracy of recognizers which are based on commonly used features like mel-frequency cepstral co-efficient (MFCC) and linear predictive coding (LPC). It is well u...

متن کامل

A Two-Channel Acoustic Front-End for Robust Automatic Speech Recognition in Noisy and Reverberant Environments

An acoustic front-end for robust automatic speech recognition in noisy and reverberant environments is proposed in this contribution. It comprises a blind source separation-based signal extraction scheme and only requires two microphone signals. The proposed front-end and its integration into the recognition system is analyzed and evaluated in noisy living room-like environments according to th...

متن کامل

Simultaneous speech recognition in noisy reverberant environme

In this paper, we examine the robustness of a Blind Signal Separation (BSS) technique in the time domain, based on a recurrent neural network, for separating multiple competing speakers in real reverberant environments. The separation network’s learning rule is based on the Maximum Likelihood Estimation criterion and was tested in real room situations in a noise-free and a noisy reverberant env...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2002

A Linear Predictive Front-end Processor for Speech Recognition in Noisy Environments

نویسنده

چکیده

منابع مشابه

Speech Recognition in Noisy Environment Using Different Feature Extraction Techniques

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

A Robust Front-End Processor combining Mel Frequency Cepstral Coefficient and Sub-band Spectral Centroid Histogram methods for Automatic Speech Recognition

A Two-Channel Acoustic Front-End for Robust Automatic Speech Recognition in Noisy and Reverberant Environments

Simultaneous speech recognition in noisy reverberant environme

عنوان ژورنال:

اشتراک گذاری